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DESCRIPTION 

Attorney Docket No, 38195.66 



REMOTE EDUCATION SYSTEM, ATTENDANCE CONFIRMATION METHOD, AND 
5 ATTENDANCE CONFIRMATION PROGRAM 



TECHNICAL FIELD 

BACKGROUND OF THE INVENTION 

1 . Field of the Invention 

10 The present invention relates to a remote education 

system which enables education to be conducted over distances 
using bi-directional communication involving multimedia 
including video and audio, and more particularly , the present 
invention relates to a remote education system which has an 

15 attendance confirmation function for confirming whether a 
legitimate student is in attendance, an attendance 
confirmation method, and an attendance confirmation program. 



BACKGROUND ART 2. Description of the Related Art 
20 Recently, with advances in communication technology, 

remote education systems have begun to be implemented which 
use bi-directional communication based on multimedia, 
including video and audio, to allow a student to gain obtain 
an education over distances via communication lines. 
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Such remote education systems are to be authorized as a_ 
valid means of education-? — but . However ; to authorize 
acquisition of units and accreditation of qualifications, 
confirmation of student attendance must be able to 
5 b eaccurately performed QGCuratcly . 

In particular, in remote education, there are cases where 
there is one lecturer for a large number of students, or where 
the student attends a programmed un manned lecture in an 
autonomous manner-? — and for , For these reasons it io thought 

10 that_ g_ a student attendance confirmation method that does not 
rely upon human resources is required. 

A conventional method of confirming the attendance of a 
student is an audio-visual authentication method, as disclosed 
in PCT Republished International Publication No. WO 00-59226^ 

15 for example. In this system, the broadcast program provider 

displays a unique viewing confirmation code to the viewer with 
unique timing, — and at a specific time, and in response, the 
viewer transmits the viewing confirmation code, which is input 
from the audio-visual terminal, to the audio-visual 

20 authentication system together with the input time, and the 

audio-visual authentication system then determines whether or 
not the viewer actually watched the program based on the 
transmitted viewing confirmation code and input time. 

However, although such a conventional audio-visual 

25 authentication system uses the input Viewing confirmation code 
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and the time information to determine whether a viewer has 
viewed a program, this only allows the system to confirm that 
a viewer has viewed the program, but there is no assurance 
that the viewer is the legitimate registered student. 
5 Accordingly, if the legitimate student asks a third person to 
be g substitute attcndancQ attend , for example, attendance is 
recorded as if the legitimate student were in attendance. 
Furthermore, when a student takes an examination using the 
remote education system, the legitimate student can easily ask 

10 a third person to be a subatituto oxamincc take the examination . 
On the other hand, conventional methods of confirming 
that the student is the legitimate registered student include 
a password authentication method in which the student inputs a 
pre^registered password, a personal information question 

15 confirmation method in which the student is asked a question 
about pre^registered personal information, an image matching 
method in which an image is taken of the face or the lilcc of 
the student and matched to a pre^registered image of the face 
or the li]co of the student, and an audio matching method in 

20 which audio of the student is captured and matched to pre^ 
registered audio of the student. 

However, in the case of the password authentication and 
personal information question confirmation methods, by simply 
informing a third person of his or her password or personal 

25 information, the legitimate student can still ask the third 



person to bo a oubotituto attcndancQ attend instead of the 
legitimate student . 

Furthermore, even in the case of the image matching 
method and the audio matching method, it is still possible for 
5 a third person to attend instead of the legitimate student_^ if 
the person who receives the attendance confirmation request 
sends prej^^recorded video or pre-recorded audio of the 
legitimate student from the terminal . 

Thus, conventional attendance confirmation methods 

10 present a problem in that it is possible for a legitimate 

student to intentionally ask a third person to bo a Gubotitutc 
attcndancc attend instead of the legitimate student , and it is 
difficult to verify that the legitimate student is actually in 
attendance . 

1 5 Thcrcf ore , — q primary object 

SUMMARY OF THE INVENTION 

To overcome the problems described above, preferred 

embodiments of the present invention io to provide a remote 
education system which, in the context of remote education, 

20 prevents a— substitute attendance performed by of a third 

person from using pre-recorded video or the lijcc, — and enable 
to confirm audio, and which enables confirmation that the 
legitimate student is in attendance. 

25 DISCLOSURE OF THE INVENTION 
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A remote education system according to a first aspect is 
a remote education oyotcm that p ref erred embodiment enables 
education over distances using multimedia bi-directional 
communication, including video and audio, compriaing including 
5 student video acquisition means for acquiring video of the 
student, facial image matching means for matching the video 
acquired by the student video acquisition means with a pre^ 
registered facial image of the student, action request means 
for requesting the student to perform an action, and action 

10 detection means for detecting the action performed by the 
student on the basis of image changes in the video of the 
student acquired by the student video acquisition means, 
thereby providing the system with a function for confirming 
that the legitimate student is in attendance. 

1 5 Because video of the student transmitted from the student 

terminal is acquired by the student video acquisition means, 
and this acquired video of the student is matched with a pre^ 
registered facial image of the student, it is possible to 
confirm that the acquired video is video of the legitimate 

20 student- Furthermore, because the student is requested to 

perform an action by the action request means, and the actions 
of the student are then detected by the action detection means, 
on the basis of image changes in the acquired video of the 
student, it is possible to detect a substitute 

25 attcndancc attendee , because the image changes corresponding to 



the action request ^does not occur if the legitimate student 
Qsk asks a third person to be a substitute attcndanco attendee 
using prerecorded video or the lilcc , and thus_j_ the attendance 
of the legitimate student can be confirmed . 
5 Examples of the action actions that the student is 

requested the student to perform by the action request means 
include any action that causes changes in the acquired video, 
and suitable examples include moving the head, closing the 
eyes, moving the mouth, or raising a hand. 

10 A remote education system according to a second 

aopoct pref erred embodiment is a remote education system 
according to the first aopcct pref erred embodiment , wherein the 
action request means includes a function for requesting that 
the student moves the position of his or her face, and the 

1 5 action detection means includes a function for detecting 
whether the position of the face of the student has moved. 

Because the action request means requests the student to 
move the position of his or her face, and the movement of the 
student's face position is then detected by the action 

20 detection means, on the basis of the image changes in the 
acquired video, it is possible to detect a substitute 
attcndanco at tendee , because the position of the face does not 
move as instructed if the legitimate student eskasks a third 
person to be a substitute at tcndanGQ at tendee using pre^ 

25 recorded video or the like, and thus_^ the attendance of the 
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legitimate student can be confirmed. 

A remote education system according to a third 
Qopcct pref erred embodiment is a remote education system 
according to either one of the first and second 
5 Qspccts pref erred embodiments , wherein the action request means 
includes a function for asking the student a question and 
requesting an audio response, and the action detection means 
includes a function for acquiring audio of the student, a 
function for recognizing the audio response from the acquired 

10 audio, a function for determining the validity of the audio 
response of the student from the recognized audio response, 
and a function for detecting movement of the mouth of the 
student accompanying the audio response by the student. 

Because the action request means asks the student a 

15 question and requests an audio response, and the action 

detection means then acquires audio of the student, recognizes 
the audio response, and determines the validity of the audio 
response from the student, while also detecting the movement 
of the mouth of the student accompanying the audio response 

20 from image changes in the acquired video, confirmation can be 
obtained that the student determined to be the legitimate 
student by the facial image matching means has provided the 
audio response to the question, and thus^ the attendance of 
the legitimate student can be confirmed. 

25 A remote education system according to a fourth 



QQpcGt preferred embodiment is a remote education system 
according to the third aGpcct pref erred embodiment , wherein the 
action detection means includes a function for matching the 
audio response of the student with pre^registered audio of the 
5 student. 

Because the audio response of the student is matched to 
pre^registered audio of the student by the action detection 
means, voice matching of the student is performed in addition 
to face matching, which allows the attendance of the 

10 legitimate student to be confirmed even more reliably . 

A remote education system according to a fifth 
Qopoct pref erred embodiment is a remote education system 
according to the third aopcct or the fourth aspcct pref erred 
embodiment , wherein the action request means includes a 

1 5 function for asking the student a question relating to time 
and requesting an audio response, and the action detection 
means includes a function for determining the validity of the 
time given in response by the student. 

Because the student is asked a question relating to time 

20 by the action request means, and the action detection means 

then determines the validity of the time given in response by 
the student, while also detecting the movement of the mouth of 
the student accompanying the audio response from image changes 
in the acquired video, it is possible to detect a substitute 

25 attondanco attendee , because a third person cannot provide an 



audio response to the question relating to time if the 
legitimate student e-e kasks the third person to be a substitute 
attcndancG attendee using prerecorded video or the like, and 
thus^ the attendance of the legitimate student can be 
5 confirmed. 

For example, the question relating to time may be a 
request for the student to provide an audio response 
consisting of including the current time, and the validity of 
the time provided as an audio response by the student may then 

10 be determined by comparing the current time to the time 
obtained by performing speech recognition of the audio 
response of the student. 

A remote education system according to a sixth 
gspcct pref erred embodiment is a remote education system 

1 5 according to any one of the third through fifth 

aspccts pref erred embodiments , wherein the action request means 

: includes a function for asking a question relating to personal 
information of the student and requesting an audio response, 
and the action detection means includes a function for 

20 matching the personal information given in rcoponoG by the 

student with pre^^registered personal information relating to 
that student . 

Because the student is asked a question about his or her 
personal information by the action request means, and the 
25 action detection means then matches the personal information 



given in rcGpongo by the student with pre^registered personal 
information relating to that student, while also detecting the 
movement of the mouth of the student accompanying the audio 
response from image changes in the acquired video, it is 
5 possible to detect aft substitute attGndanco attendee , because a 
third person cannot provide an audio response to the question 
relating to personal information if the legitimate student 
a^asks the third person to be aft substitute 
attcndancQ attendee using prerecorded video or the like, and 
10 thus_j_ the attendance of the legitimate student can be 
confirmed. 

The personal information of the student requested as an 
audio response is preferably selected at random from a 
plurality of pre^registered items of personal information. By 
15 doing s^this, a substitute attondancc attendee cannot predict 
the answer in advance, and it is possible to confirm that the 
legitimate student is in attendance with even greater 
reliability. 

A remote education system according to a seventh 
20 aopQct pref erred embodiment is a remote education system 
according to any one of the first through sixth 
aopocto pref erred embodiments , wherein the action detection 
means includes a function for determining that the facial 
image of the student does not move in a discontinuous manner. 
25 According to this aopcct p ref erred embodiment , because it - 
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is possible to detect discontinuous positional changes in the 
facial image that occur when a third person attendee switches 
the video sent from the terminal to a prerecorded video in 
response to an action request, it is possible to prevent a 
5 substitute attcndancc attendee by a third person with greater 
certainty, and confirm the attendance of the legitimate 
student with even greater reliability. 

A remote education system according to an eighth 
QGpcGt pref erred embodiment is a remote education system 

1 0 according to any one of the first through seventh 

QopoGta pref erred embodiments , wherein the action detection 
means has a function for determining whether image changes 
occur within a specified time of the action request means 
requesting the student to perform an action. 

1 5 According to- this aspGct p ref erred embodiment , it is 

possible to determine non-attendance of the legitimate student 
if the required image changes do not occur within a 
predetermined time of the action being requested. Accordingly, 
even ouppooing assuming that it were possible to generate video 

20 and the like corresponding to the action request at the 

terminal, the substitute attendee can be denied sufficient 
time to prepare such video, and it is therefore possible to 
confirm that the legitimate student is in attendance with even 
greater certainty. 

25 A remote education system according to a ninth 



aGpcct pref erred embodiment is a remote education system 
according to any of the first through eighth aopQcto pref erred 
embodiments , wherein the action request means has a function 
for using unpredictable timing when requesting the student to 
5 perform an action. 

According to this aspoct pref erred embodiment ^ because the 
student cannot predict when an action request will occur, the 
student cannot leave the student terminal during a lecture, 
which works as an incentive to ensure that the student is 
10 continuously in attendance throughout the entire lecture. 
A remote education system according to a tenth 
Qspcct pref erred embodiment is a remote education system 
according to any one of the first through ninth 

QspQcto pref erred embodiments , wherein the action request means 
15 has a function for requesting the student to perform an action 
based on a command by the lecturer. 

According to this aopQct p ref erred embodiment , because the 
lecturer can control the timing with which action requests are 
issued to students, attendance confirmation can be performed 
20 without adversely impacting eft— the learning of the student. 

An attendance confirmation method according to an 
eleventh ggpGct pref erred embodiment is a method of confirming 
whether a legitimate student is in attendance in a remote 
education setting, comprising including the steps of acquiring 
25 video of the student, matching the acquired video with a pre^ 



registered facial image of the student, requesting the student 
to perforin an action, and detecting the action by the student 
on the basis of image changes in the acquired video. 

Because video of the student sent from the student 

'5 terminal is acquired during the step of acquiring video of the 
student, and the acquired video of the student is matched with 
a pre^registered facial image of the student in the step' of 
matching the pre^registered facial image of the student, it is 
possible to confirm that the acquired video is video of the 

10 legitimate student. Furthermore, because the student is 

requested to perform an action during the step of requesting 
an action, and the actions of the student are then detected in 
the step of detecting an action, on the basis of image changes 
in the acquired video of the student, it is possible to detect 

1 5 a substitute attondancc attendee , because the image changes 
corresponding to the action request do not occur if the 
legitimate student eskasks a third person to be a substitute 
attcndancQ attendee using prerecorded video or the like, and 
thus_^ the attendance of the legitimate student can be 

20 confirmed. 

Examples of the action requested of the student in the 
step of requesting an action include any action that causes 
changes in the acquired video, and suitable examples include 
moving the head, closing the eyes, moving the mouth, or 

25 raising a hand, 

13 



An attendance confirmation method according to a twelfth 
aopoct pref erred embodiment is an attendance confirmation 
method according to the eleventh Qopcct pref erred embodiment , 
wherein the step of requesting an action includes a step of 
5 requesting that the student move the position of his or her 
face, and the step of detecting the action includes a step of 
detecting whether the position of the face of the student has 
moved . 

Because the student is requested to move his or her face 

10 during the step of requesting an action, and the movement of 
the student's face position is then detected in the step of 
detecting the action, on the basis of image changes in the 
acquired video, it is possible to detect a substitute 
attcndancc attendee , because the position of the face will not 

1 5 move as instructed if the legitimate student as^asks a third 
person to be a substitute attcndancG attendee using pre^ 
recorded video or the like, and thus_^ the attendance of the 
legitimate student can be confirmed. 

An attendance confirmation method according to a 

20 thirteenth aQpcct pref erred embodiment is an attendance 

confirmation method according to either one of the eleventh 
and twelfth aopccto pref erred embodiments , wherein the step of 
requesting an action includes a step of asking the student a 
question and requesting an audio response, and the step of 

25 detecting the action includes a step of acquiring audio of the 
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student, a step of performing recognition of the audio 
response from the acquired audio, a step of determining the 
validity of the audio response of the student from the 
recognized audio response, and a step of detecting movement of 
5 the mouth of the student accompanying the audio response of 
the student. 

Because the student is asked a question and requested to 
provide an audio response during the step of requesting an 
action, and then during the subsequent step of detecting the 

10 action, audio of the student is acquired, the audio response 
is subjected to recognition, and the validity of that audio 
response is determined, while movement of the mouth of the 
student accompanying the audio response is also detected from 
image changes in the acquired video, confirmation can be 

1 5 obtained that the student determined es — to be the legitimate 
student in the step of matching the pre^registered facial 
image of the student has provided the audio response to the 
question, and thus_^ the attendance of the legitimate student 
can be confirmed. 

20 An attendance confirmation method according to a 

fourteenth Qopcct pref erred embodiment is an attendance 
confirmation method according to the thirteenth 
aopcct pref erred embodiment , wherein the step of detecting the 
action includes a step of matching the audio response of the 

25 student with pre^registered audio of "the'~stu"dent . 



Because the audio response of the student is matched with 
pre^registered audio of the student in the step of detecting 
the action, voice matching of the student is performed in 
addition to facial image matching, and the attendance of the 
5 legitimate student can be confirmed with even greater 
reliability. 

An attendance confirmation method according to a 
fifteenth aspGct pref erred embodiment is an attendance 
confirmation method according to either one of the thirteenth 

1 0 and fourteenth Qspccts pref erred embodiments , wherein the step 
of requesting an action includes a step of asking the student 
a question relating to time and requesting an audio response, 
and the step of detecting the action includes a step of 
determining the validity of the time given in response by the 

1 5 student . 

Because the student is asked a question relating to time 
in the step of requesting an action, and in the subsequent 
step of detecting the action, the validity of the time given 
in response by the student is determined, while detecting the 

20 movement of the mouth of the student accompanying the audio 
response on the basis of image changes in the acquired video, 
it is possible to detect a substitute attcndanco attendee , 
because the third person cannot provide afi— a requested 
response to the question relating to time if the legitimate 

25 student a ^asks a third person to be a substitute 



attcndancG attendee using prerecorded video or the like, and 
thus_^ the attendance of the legitimate student can be 
confirmed. 

For example, the question relating to time may be a 
5 request for the student to provide an audio response 

consisting of including the current time, and the validity of 
the time provided as an audio response by the student may be 
determined by comparing the current time to the time obtained 
by performing speech recognition of the audio response of the 
10 student. 

An attendance confirmation method according to a 
sixteenth Qspcct pref erred embodiment is an attendance 
confirmation method according to any one of the thirteenth 
through fifteenth Qspects pref erred embodiments , wherein the 

1 5 step of requesting an action includes a step of asking a 

question relating to personal information of the student and 
requesting an audio response, and the step of detecting the 
action includes a step of matching the personal information 
given in response by the student with pre^registered personal 

20 information relating to that student. 

The student is asked a question about his or her personal 
information in the step of requesting an action, and in the 
subsequent step of' detecting the action, the personal 
information given in response by the student is matched with 

25 prej2,registered personal information relating to that student. 



and the movement of the mouth of the student accompanying the 
audio response is detected on the basis of image changes in 
the acquired video. Consequently_^ it is possible to detect a 
substitute attcndance attendee , because the third person cannot 

,5 provide an audio response to the question relating to personal 
information if the legitimate student asks a third person to 
be a substitute attGndancc attendee using pre-recorded video or 
the like, and thus_^ the attendance of the legitimate student 
can be confirmed. 

10 The personal information of the student requested as an 

audio response is preferably selected at random from a 
plurality of pre^^^egistered items of personal information. By 
doing sethis, a substitute attendee cannot predict the answer 
in advance, and it is possible to confirm that the legitimate 

15 student is in attendance with even greater reliability. 

An attendance confirmation method according to a 
seventeenth ggpcct p ref erred embodiment is an attendance 
confirmation method according to any one of the eleventh 
through sixteenth QGpocto pref erred embodiments , wherein the 

20 step of detecting the action includes a step of determining 
that the facial image of the student does not move in a 
discontinuous manner . 

According to this aopcct p ref erred embodiment , because it 
is possible to detect discontinuous positional changes in the 

25 facial image that occur when a third person attendee switches 



the video sent from the terminal to a prerecorded video in 
response to an action request, it is possible to prevent a 
substitute attendance by a third person with greater certainty, 
and confirm the attendance of the legitimate student with even 
5 greater reliability. 

An attendance confirmation method according to an 
eighteenth aspcct pref erred embodiment is an attendance 
confirmation method according to any one of the eleventh 
through seventeenth ggpGctci pref erred embodiments , wherein the 

1 0 step of detecting the action has a step of determining whether 
image changes occur within a specified time from requesting to 
perform an action by the step of requesting an action. 

According to this aopcct p ref erred embodiment , it is 
possible to determine non-attendance of the legitimate student 

15 if the required image changes do not occur within a 

predetermined time of the action being requested. Accordingly, 
. even auppoaing assuming that it were possible to generate video 
and the like corresponding to the action request at the 
terminal, the substitute attendee can be denied sufficient 

20 time to prepare such video, and it is therefore possible to 

confirm that the legitimate student is in attendance with even 
greater certainty. 

An attendance confirmation method according to a 
nineteenth aopcct pref erred embodiment is an attendance 

25 confirmation method according to any one of the eleventh 



through eighteenth ggpocts pref erred embodiments , wherein the 
step of requesting an action uses unpredictable timing when 
requesting the student to perform an action. 

According to this aspcct p ref erired embodiment , because the 
5 student cannot predict when an action request will occur, the 
student cannot leave the student terminal during a lecture, 
which v i 7or]co ao provides an incentive to ensure that the student 
is continuously in attendance throughout the entire lecture. 
An attendance confirmation method according to a 

1 0 twentieth aspoct pref erred embodiment is an attendance 

confirmation method according to any one of the eleventh 
through nineteenth Qopcctij pref erred embodiments , wherein the 
step of requesting an action requests the student to perform 
an action based on a command by the lecturer. 

15 According to* this agpcct pref erred embodiment , because the 

lecturer can control the timing with which action requests are 
issued to students, attendance confirmation can be performed 
without adversely impacting on— the learning of the student. 

An attendance confirmation program according to a twenty- 

20 first aspect is a program for executing, on a computer, an 
attendance confirmation method according to any one of the 
eleventh through twentieth aopccto pref erred embodiment . 

By equipping a computer system with a program for 
executing each of the steps of an attendance confirmation 

25 method according to any one of the eleventh through twentieth 



QDpQctG pref erred embodiments , it is possible to prevent a 
substitute attondancc attendee by a third person and confirm 
that the legitimate students are in attendance in a remote 
education setting. 
5 These and other ob jccts elements , steps , characteristics , 

features and advantages of the invention will be more readily 
apparent from the following detailed description of preferred 
embodiments of the present invention taken in conjunction with 
the accompanying drawings . 

10 

BRIEF DESCRIPTION OF THE DRAWINGS 
Fig. 1 is a system configuration diagram of a remote 
education system according to a ^a preferred embodiment of the 
present invent ion ; 
T5 Fig. 2 is a diagram showing an example of a screen 

displayed on a student terminal by a remote education system 
according to a- fta preferred embodiment of the present 
invention; 

Fig. 3 is a flowchart (1 of 3) of an attendance 
20 confirmation program of a remote education system according to 
a fta preferred embodiment of the present invention; 

Fig. 4 is a flowchart (2 of 3) of the attendance 
confirmation program of the remote education system according 
to the preferred embodiment of the present invention; and 
25 Fig. 5 is a flowchart (3 of 3) of the attendance 



confirmation program of the remote education system according 
to the preferred embodiment of the present invention. 



BEST MODE FOR CARRYING OUT THE INVENTION 

5' 

DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS 
Fig. 1 is a system configuration diagram of a remote 
education system according to a fta preferred embodiment of the 
present invention. In the diagram, the reference numeral 1 

10 indicates a remote education system. The system 1 
comprioGO includes a Web server 10, a bidirectional 
communication server 20, a content distribution server 30, an 
operations management system 40, a lecturer terminal 60 which 
is used by a lecturer, and a communication line connecting 

15 device 70, which are all connected together by a network. The 
remote education system eaeeis connecting to student terminals 
101, 102, and so on, used by students, via a communication 
line 2, when a^remote education services are provided. 

The Web server 1 0 is a WWW server which acts as a 

20 mediator when using the present system, and tasks such as the 
setting and progression of lectures, as well as attendance 
related procedures, can be performed by using a web browser to 
access a predetermined page from either the lecturer terminal 
60, or the student terminals 101, 102, and so on. 

25 ' The bidirectional "communication server 20 

22 



Gompriaoo includes a connection destination table for storing 
the terminal addresses of terminals with v i yhich Gonncctiono arc 
madc that are connected to the system , and connects to 
terminals that are registered in this connection destination 

.5 table, providing a bidirectional communication function based 
on multimedia information including video and audio. During a 
lecture, the terminal addresses of the lecturer terminal 60 
and the student terminals 101, 102, and so on^ used by the 
students who are attending the lecture are registered in the 

10 connection destination table. The bidirectional communication 
server 20 then receives video and audio sent from the lecturer 
terminal 60, as well as video and audio sent from the student 
terminals 101, 102, and so on, integrates the. received video 
and audio, and transmits the result to the lecturer terminal 

15 60 and the student terminals 101, 102, and so on. 

The content distribution server 30 comprigoG includes an 
educational material database 23 for storing multimedia 
educational material including moving images, still images, 
audio, text, and sound effects. The server 30 provides a 

20 function for distributing specified content to the terminals 
registered in the connection destination table. During a 
lecture, the terminal addresses of the lecturer terminal 60 
and the student terminals 101, 102, and so on used by the 
students who are attending the lecture are registered in the 

25 connection destination table, and the educational material ■ 



content specified by the lecturer terminal 50 is distributed 
to each terminal. 

The operations management system 40 is a system for 
managing the remote education operations, and 
5 comprioQO includes a lecture information database 50 for 
storing a variety of data relating to the lectures to be 
provided, and performs data processing such procGaoingg as etr- 
lecture establishment processing 41 for establishing lectures, 
efi— attendance registration processing 42 for receiving 

10 attendance for the established lectures, a^lecture opening 
processing 43 for opening the established lectures and afi— 
attendance confirmation processing 44 for confirming student 
attendance of the opened lectures. 

The lecturer terminal 60 compricsQ a includes a television 

15 camera (a) for acquiring video of the lecturer, a headset (b) 
for inputting and outputting audio to and from the lecturer, a 
monitor screen (c) for displaying images, a keyboard (d) and a 
mouse (e) for inputting commands and messages, and a computer 
(f) , which comprioGO includes a video input interface, an audio 

20 input /output interface, and a network connection device and 
the like. — On the computer . The following programs are 
preferably installed on the computer: a video and audio 
communication program, which acquires video and audio of the 
lecturer and transmits this information to the bidirectional 

25 communication server 20, and also" outputs video and audio sent 
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from the bidirectional communication server 20 to the monitor 
screen and the headset_^ respectively, a content display 
program for outputting images and audio distributed from the 
content distribution server 30 to the monitor screen and the 

.5 headset, and a web browser for accessing the Web server 10. 

The communication line 2 may be adapted to any type of 
communication line including a wired computer networks-, a 
wireless computer network, a telephone line, a satellite line, 
and the Internet, although in the following description, the 

10 Internet is used as an example. Accordingly, the communication 
line connecting device 70 uses an internet router for 
connecting to the Internet. 

The student terminals 101, 102, and so on_^ are similar to 
the lecturer terminal 60, and each compri se ine lude a 

15 television camera (a) for acquiring video of the student, a 

headset (b) for inputting and outputting audio to and from the 
student, a monitor screen (c) for displaying images, a 
keyboard (d) and a mouse (e) for inputting commands and 
messages, and a computer (f ) , which comprioGO includes a video 

20 input interface, an audio input /output interface, and a 

network connection device and the lilco. — On the computer . The 
following programs are installed on the computer: a video and 
audio communication program, which acquires video and audio of 
the student and transmits this information to the 

25 bidirectional communication server 20, and also outputs video 
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and audio sent from the bidirectional communication server 20 
to the monitor screen and the headset_^ respectively, a content 
display program for outputting images and audio distributed 
from the content distribution server 30 to the monitor screen 
■5 and the headset, and a web browser for accessing the Web 
server 10. 

Next, the steps involved in of implementing remote 
education using the remote education system 1 according to a 
preferred embodiment of the present invention are described. 

10 First, the administration office presenting the remote 

education accesses' the page for the office on the Web server 
10 from either the lecturer terminal 60 or a dedicated office 
terminal (not shown in the figure) , and then decides and 
inputs the lectures which are to be established, based on the 

15 academic calendar and the corresponding lecturer schedules. As 
a result, a lecture schedule database 51 is constructed in the 
lecture information database 50 by the lecture establishment 
processing 41 of the operations management system 40, and an 
attendance record is generated for each lecture and registered 

20 in an attendance record database 52. 

Furthermore, — t^The administration office then begins 
student recruitment, and issues an ID and password for 
accessing the Web server 10 to each student who satisfies a 
predetermined standard for enrolling . enrollment . Each student 

25 issued with an ID and password accesses the student page on 
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the Web server 1 0 from his or her student terminal and 
registers his or her personal information, registers a facial 
image used to match facial images when confirming attendance, 
and registers an audio file that is used to match audio 

.5 responses to questions relating to personal information when 
confirming attendance. 

During the registration of personal information, the 
attendance registration processing 42 of the operations 
management system 40 registers various information, such as, 

10 for example, the name, age, gender, date of birth, address, 
telephone number, e-mail address, and tho like other useful 
information input by each student in the personal information 
database 53 of the lecture information database 50, together 
with the ID and password for that student, and the terminal 

15 address of each student terminal- Furthermore, a determination 
as to whether the student who wants to attend the lecture can 
attend the lecture or not is made by referring to the lecture 
schedule database 51 , and if attendance is possible, then the 
student in question is registered in the attendance record of 

20 the relevant lecture in the attendance record database 52, 
During the facial image registration, the attendance 
registration processing 42 of the operations management system 
40 sets the terminal address of the relevant student terminal 
and the address of the operations management system in the 

25 connection destination table of the bidirectional 



communication server 20, and records a frame of video as a 
still image at the time when the student clicks a facial image 
registration button (not shown in the figure) provided on the 
web page, and then registers this still image in the facial 

.5 image database 54 of the lecture information database 50. 
During the audio registration, the attendance 
registration processing 42 of the operations management system 
40 sets the terminal address of the relevant student terminal 
and the address of the operations management system in the 

1 0 connection destination table of the bidirectional 

communication server 20, and acquires a^i— audio from the 
student when he or she clicks the audio registration button 
(not shown) provided for each item of personal information on 
the Web page, and then registers the acquired audio in the 

15 audio database 55 of the lecture information database 50 
according to each item of personal information. 

When the date and time of the lecture set in the lecture 
schedule database 51 arrives, the lecture is opened in the 
bidirectional communication server 20 and the content 

20 distribution server 30 by the lecture opening processing 43. 

When the students of the lecture access the attendance 
reception page from the student terminals 101, 102, and so on, 
and check in using their predetermined IDs and passwords, the 
terminal addresses of the relevant student terminals are set 

25 in the connection destination table of the bidirectional 



communication server 20 and the distribution destination table 
of the content distribution server 30. 

Furthermore, when the lecturer of the relevant lecture 
accesses the lecturer reception page on the Web server from 
5 the lecturer terminal 60, the terminal address of the lecturer 
terminal 60 is set in the connection destination table of the 
bidirectional communication server 20 and the distribution 
destination table of the content distribution server 30. 

The attendance confirmation described below is performed 
10 by the attendance confirmation processing 44 of the operations 
management system 40 for the students who have checked in, and 
if the student is confirmed as a legitimate student, then the 
time of attendance" is recorded in the attendance record . 

In this manner, the lecture begins, and the lecturer 
15 conducts the lecture from the lecturer terminal 60, 

distributing video and audio of the lecturer to the student 
terminals. Furthermore, the lecturer can control the content 
distribution server 30 from the lecturer terminal 60, via the 
Web server 10, and by searching the educational material 
20 database 32 for the necessary educational material and then 
playing back the material, the relevant educational material 
is distributed to the students. 

Fig. 2 shows an example of the screen which appears on 
the monitor screen of the student terminals 101, 102, and so 
25 on_^ during the lecture. In the figurc Fig. 2 , symbol A 
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indicates the lecturer screen for displaying video of the 
lecturer sent from the lecturer terminal 60, and symbol B 
indicates the student screen for displaying video of the 
students sent from the student terminals 101, 102, and so on, 

.5 together with their names. The reason that the names of the 
students are displayed in the student screen B is so that the 
lecturer can request a comment or a response from a specific 
student using his or her name. 

Symbol C indicates the content screen for displaying a— 

10 content distributed from the content distribution server 30, 
and a pointer can be displayed on the content screen C by an 
operation from the lecturer terminal 60, and this pointer can 
be used to indicate points of interest to the students when 
describing the contents of the educational material being 

15 presented. 

Symbol D indicates an input screen for receiving input 
when the student makes a comment, responds to a question asked 
by the lecturer, or leaves in midcourse and exits the lecture, 
and symbol E indicates a message screen for displaying 

20 messages to the students from the lecturer terminal and the 
operations management system. The messages issued to the 
student when the attendance confirmation processing 44 
confirms the attendance of the student are displayed in this 
message screen E. 

25 The display on the monitor screen of the lecturer 
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terminal 60 is the same as that of the monitor screen of the 
student terminals, with the exception that the command input 
screen D also includes buttons for controlling the search for, 
and reproduction of, content distributed by the content 
5 distribution server 30, as well as control buttons for the 
pointer displayed in the content screen C. 

During the lecture, students may ask a qucGtion questions 
or make a comment comments to the lecturer by clicking the 
speak button. When a student clicks the speak button, the 

10 student in qucotion flickers in the student screen B, and the 
lecturer can accept the comment by clicking the display of 
that student from the lecturer terminal. At this time, the 
video of the relevant student is scaled up for distribution to 
the lecturer terminal and to each student terminal, and the 

1 5 audio of the relevant student is also distributed to the 

lecturer terminal and the other student terminals. As a result, 
the relevant student can converse with the lecturer using 
video and audio, and the contents of the conversation are also 
transmitted to the. other students. 

20 Furthermore, during the lecture, the lecturer can also 

set questions for all of the students or a specific student to 
answer. When a student or students specified by the lecturer 
either click a selection button or input a message into the 
message box and click the respond button, the contents of the 

25 response are transmitted to" the operations management system 
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40 via the Web server 10, and the aggregated results are 
displayed on the lecturer terminal 60. Furthermore, the 
lecturer can also display those aggregated results to the 
students^ if necessary, in vfhich cagc such that the aggregated 
5 results are distributed to the student terminals via the 
content distribution server 30 on an instruction from the 
lecturer terminal 60. 

During the lecture, attendance confirmation is performed 
as needed by the attendance confirmation processing 44 of the 
10 operations management system 40 in the manner described below, 
thus^ enabling confirmation that the legitimate students are 
in attendance. 

When a student needs to leave in midcourse for a toilet 
break or the like, he or she clicks the leave seat button. In 

15 this case, the image of the relevant student in the student 

screen B changes to an away mark, and the time when he or she 
left is recorded in the attendance record. Furthermore, when 
the student returns, he or she clicks the take seat button. As 
a result, the away mark of the relevant student in the student 

20 screen B returns to normal, and the time when he or she 

returned is recorded in the attendance record. Furthermore, 
each student clicks the exit button at the end of the lecture 
or if he or she wishes to exit the lecture early. As a result, 
the time when the student exits is recorded in the attendance 

25 record, and the terminal address of the relevant student is 



deleted from the connection destination table and the 
distribution destination table, thereby terminating the 
attendance of the student. 

In this manner, the lecturer can accurately ascertain the 
5 attendance status, and because the arrival time, exit time, 
leave times, and return times are recorded automatically in 
the attendance record, it is possible to ascertain the actual 
attendance time for the lecture in question. In particular, in 
the remote education system 1 ei — according to this preferred 

10 embodiment, because attendance confirmation for each student 
may be performed at any time by the attendance confirmation 
processing 44, it can be is expected that students will be 
very careful about managing their attendance time at lectures, 
to avoid being marked as absent . 

15 In addition, the bidirectional communication server 20 

h^ ^includes an application sharing function which can run a 
common application between the lecturer terminal and the 
student terminals and exchange information on the operations 
performed at each terminal between the terminals, thereby 

20 sharing the application between all participants. Consequently, 
it is possible to implement a lecture in the form of a seminar 
in which a common whiteboard on which text and graphics can be 
drawn is displayed on the screen of each terminal, and the 
discussion progresses with the lecturer and the students 

25 adding text and graphics freely to the whiteboard. 



Furthermore, a function is also provided whereby a chat 
screen is displayed on each terminal, and text-based 
conversation or discussion takes place between the lecturer 
and the students. 

•5 In this manner, according to the remote education system 

1 of this preferred embodiment, each student can receive video 
and audio of the lecturer from the bidirectional communication 
server 20, while receiving multimedia educational material 
distributed from the content distribution server 30, and the 

10 lecturer can set questions for the students and request 

responses, or accept questions or comments from students and 
establish a conversation based on audio and video using the 
bidirectional communication server 20. Consequently, the same 
level of teaching effectiveness can be obtained as if the 

1 5 students were given a lesson in a lecture room using common 
educational materials, even though each student is in a 
distant location. 

In the preferred embodiment described above, video of 
each student is preferably displayed in the student screen B, 

20 but it is also possible to use an icon based display, in which 
a still image is displayed except when the student asks a 
question or makes a comment to the lecturer. As a result, even 
if the communication line has limited bandwidth, remote 
education can still be implemented without placing a large 

25 burden on the communication lines. 



Furthermore, in the preferred embodiment described above, 
the name of each student is displayed in the student screen — 

However, it is also possible to display only video or an 
icon of each student in the student screen B, and to provide 
5 the attendance record corresponding to the student screen to 
the lecturer terminal separately. 

In the preferred embodiment described above, the 
description focused on the case of a manned lecture, where the 
lecture is performed live by a lecturer, but it is also 

10 possible to set up unmanned lectures, which are performed 
using a lecture video. In this case, the lecture video is 
created in advance by the administration office, and is 
registered in the educational material database 32 of the 
content distribution server 30, and the lecture video to be 

15 used is set in the lecture schedule database 51. When the date 
and time of the lecture arrives, the lecture opening 
processing -43 of the operations management system 40 specifies 
the lecture video for distribution by the content distribution 
server 30, and starts the lecture automatically. Appropriate 

20 questions are set within the lecture video, and the responses 
of the students to these questions are collected and recorded 
automatically. Furthermore, if the students have questions or 
the like, they can send them to the administration office by 
e-mail, where they are forwarded to the lecturer concerned and 

25 answered at a later datc time . 
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Next, the attendance confirmation processing 44 of the 
operations management system 40 is described in detail. 

The attendance confirmation processing 44 
compri GOG includes a student image acquisition processing 45 

.5 for acquiring an image of each student transmitted from each 
student terminal, a facial image matching processing 46 for 
matching the acquired image of each student with the facial 
image of that student in the facial image database 54, an 
action request processing 47 for requesting, of each student 

10 who passes facial image matching, an action accompanying 

changes in the image transmitted from the student terminal, 
and an action detection processing 48 for detecting changes in 
the image transmitted from the student terminals in response 
to the requested action. In the present preferred embodiment, 

15 the three kinds of actions , for example, are preferably 

requested %eof each student such as moving the head position 
as instructed, giving the current time verbally, and answering 
a question relating to personal information verbally. 

Fig. 3 through Fig. 5 show the processing flow of the 

20 attendance confirmation processing 44. 

Ag ohovm in the figure, — whcn When the attendance 
confirmation time arrives (S100), an attendance confirmation 
request is sent to the lecturer terminal 60 (SI 02), and when 
the lecturer acknowledges the attendance confirmation request, 

25 attendance confirmation is performed (SI 04). The start of the 



lecture and a random time during the lecture are chosen as 
attendance confirmation times. Performing attendance 
confirmation at a random time during the lecture ensures that 
the students cannot predict when attendance confirmation will 
5 occur, which works as provides an incentive to ensure that the 
students are continuously in attendance throughout the entire 
lecture. However, because it may be disruptive to perform 
attendance confirmation while the lecturer is in the middle of 
performing the lecture, the attendance confirmation request is 

1,0 first sent to the lecturer, and attendance confirmation is 
performed only after the lecturer suspends the lecture and 
clicks the acknowledge button for the attendance confirmation 
request. In the case of an unmanned lecture using a lecture 
video, attendance confirmation may be performed, for example, 

1 5 by registering points where interruption is possible in the 
lecture video, and pausing playback of the lecture video at 
the first point where interruption is possible after the 
attendance confirmation request is issued. 

If the attendance confirmation time has not yet been 

20 reached, the flow proceeds to S406 and repeats until the 
lecture ends. 

Next, the attendance record for the relevant lecture is 
acquired from the attendance record database 52 (SI 06), the 
terminal addresses of the student terminals of the students 
25 registered in the attendance record are sent to the 

37 



bidirectional communication server 20, video and audio of the 
relevant students is acquired (SI 08), and a notification of 
facial image confirmation is sent to the relevant student 
terminals (S1 10). For example, the message "We will now 
5 confirm your attendance. Please present your face to the 

camera as you did when registering attendance" is displayed in 
the message screen. At this time, it is possible to add the 
message "click here when ready" -s esuch that the Web server 10 
detects when "here" is clicked, enabling the flow to proceed 

10 to the next step. 

Facial image data for each student is then retrieved from 
the personal information database 53 (SI 12), and a still image 
of the student, extracted from the video that is currently 
being acquired, is matched with the facial image data of that 

15 student (SI 14). In image matching, direct matching processing 
of the acquired video of the student is performed using the 
facial image data of the relevant student as a template, to 
determine the point with the highest degree of correlation. It 
is also possible to perform matching processing after 

20 normalizing the image size of the still image extracted from 
the acquired video of the student using a feature extraction 
calculation, which determines the size of the outside contour 
of the face, or the position or shape of the eyes, nose, or 
mouth, and it is also possible to compare the results of 

25 feature extraction calculation for the facial image data of 
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the relevant student. 

As a result of the image matching, if a predetermined 
degree of correlation is obtained, then the flow proceeds to 
the next step (S116), whereas if a predetermined degree of 
5 correlation is not obtained, a confirmation message is sent to 
the relevant student terminal (SI 18). If there is a response 
to the message, the flow returns to step S1 10 and the facial 
image confirmation request is repeated (SI 20), whereas if 
there is no response, the flow proceeds to S408 and the 

10 student is marked as absent in the attendance record. 

Next, facial movement confirmation is performed. First, 
the initial position of the face of the student in the 
extracted still image is detected (SI 22). The position of the 
face is determined as the point where the degree of 

1 5 correlation is highest for the matching processing performed 
in the facial matching described above. 

A facial movement request is then sent to the relevant 
student terminals (SI 24). For example, a message such as 
"Please move your face approximately 1 0 cm to the right" is 

20 displayed in the message screen. The direction in which the 
face is to be moved and the movement distance can be changed 
at random, as appropriate. As a result, it is almost 
impossible to achieve image confirmation using video that has 
been prepared in advance, enabling a which prevents substitute 

25 attendance by a third person to be prevented . 
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Here, facial position detection of each student is 
performed in a continuous manner on the acquired video from 
the student, using the image matching described above (S126), 
and when a change in the position of the face that corresponds 
5 to the above message is detected, the flow proceeds to the 

next step (SI 28). Furthermore, if a change in the position of 
the face is not detected within a predetermined time (SI 30), a 
confirmation message is sent to the relevant student terminal 
(S132). If there is a response to the message, the flow 

10 returns to step SI 20 and the facial movement request is 

repeated (S134), whereas if there is no response, the flow 
proceeds to S408 and the relevant student is marked as absent 
in the attendance record. 

Next, time related audio response confirmation is 

15 performed. First, a time-based audio response request is sent 
to the relevant student terminals (S200) . For example, a 
message such as "Please respond verbally with the current time 
displayed in the lecturer screen" is displayed on the message 
screen. The audio response of each student is detected from 

20 the signal level of the audio being acquired from that student, 
and if a sequential audio response is detected (S202) , video 
and audio is acquired during the audio response (S204) . If a 
predetermined time elapses before a sequential audio response 
is detected (S206) , a confirmation message is sent to the 

25 relevant student terminal (S208) . If there is a response to 
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the message, the flow is repeated from S200 (S210), whereas if 
there is no response, the flow proceeds to S408 and the 
relevant student is marked as absent in the attendance record. 
Speech recognition is then performed on the audio 
5 acquired during the audio response (3212). Because_j_ in this 
case, the object of the speech recognition is limited to 
recogniging recognition of a time, a matching process is 
preferably performed in syllable units, using a speech 
dictionary relating solely to time. A &Since there are only a 

10 limited number of words to identify, unspecified-speaker 
speech recognition using a speech dictionary based on 
recordings of an unspecified speaker may be used to recognize 
the speech, but specif ied-speaker speech recognition, in which 
the student registers speech relating to the time in advance, 

1 5 and this pre^registered audio is then used as the speech 
dictionary, may also be used. 

The current time displayed on the lecturer screen is then 
acquired (S214), and if the time given by the speech 
recognition in the audio response is within a -predetermined 

20 margin of error (S216), mouth movement is detected in the 

video acquired during the audio response (S218). In this mouth 
movement detection, shape parameters are determined for the 
mouth by the feature extraction calculation described above, 
for those frames of the video which were acquired during the 

25 audio response and which correspond to each syllable 
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identified by the speech recognition, and these parameters for 
each syllable are then compared with mouth shape parameters 
obtained from a table that registers the relationship between 
syllables and mouth shape parameters, enabling an overall 
5 degree of correlation to be determined- Because the object of 
this mouth movement determination is to detect whether the 
video of the student subjected to facial image matching is 
prerecorded video data, or whether the audio response giving 
the time is prerecorded audio data, it is not necessary to 

10 determine exact correspondence, and for example, a simple 

method of detecting whether changes occur in the shape of the 
mouth of the student in the period where the audio response is 
detected may be used. Furthermore, when specif ied-speaker 
speech recognition is performed by having each student 

15 preregister audio relating to the time, it is possible to also 
register corresponding video of the student at the same time 
as he or she registers the audio, and then make the mouth 
movement determination by directly comparing the video of the 
student saying each syllable, as recognized by the speech 

20 recognition, with the corresponding pre^registered video of 
the student. Furthermore, it is also possible to make the 
mouth movement determination by determining and recording the 
mouth shape parameters from the video of the student when he 
or she registers audio relating to time, and then comparing 

25 these results with the mouth shape parameters determined from 



the video of the student acquired when he or she says each 
syllable, as recognized by the speech recognition. 

Here, if a determination is made that the mouth movement 
of the student corresponds with the audio response, then the 
5 flow proceeds to the next step (S220) . If a determination is 
made, at S216, that the time given in the audio response is 
incorrect, or a determination is made, at S220, that the video 
does not correspond with the audio response, then the flow 
returns to S208 and a confirmation message is sent to the 

10 relevant student terminal. If there is a response, the flow 
repeats from S200. 

In the above -described preferred embodiment, the cQsc a 
situation was described in which the time given as an audio 
response is subjected to speech recognition and compared to 

15 the current time, but it is also possible to sequentially 

extract syllable data corresponding to the current time from a 
speech dictionary and compare the data piece by piece with the 
audio from the student acquired during the audio response, and 
then make a judgment that the time given in the audio response 

20 is correct if the predetermined degree of correlation is 
obtained during the entire audio response . 

Next, audio response confirmation of personal information 
of the student is performed. First, a question item relating 
to personal information is selected, and an audio response 

25 request for that personal information question item is then 
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sent to the relevant student terminals (S300) . For example, a 
message such as "Please respond verbally with your date of 
birth" is displayed on the message screen. This question item 
relating to personal information is preferably chosen at 
5 random sesuch that the students cannot predict the question. 
The audio response of each student is detected from the signal 
level of audio being acquired from that student, and if a 
sequential audio response is detected (S302) , video and audio 
jbs — are acquired during the audio response (S304) . If a 

10 predetermined time elapses before a sequential audio response 
is detected (S306) , a confirmation message is sent to the 
relevant student terminal (3308) . If there is a response to 
the message, the flow is repeated from S300 (S310), whereas if 
there is no response, the flow proceeds to S408 and the 

15 relevant student is marked as absent in the attendance record. 

Next, the audio data corresponding to the question item 
relating to personal information of the relevant student is 
retrieved from the audio database 55 (S312), and using this 
data as a template, matching processing is performed on the 

20 audio acquired during the audio response (5314). As a result, 

if a predetermined degree of correspondence is obtained (S316), 
mouth movement is detected in the acquired video of the audio 
response (S318). In this mouth movement detection, shape 
parameters for the mouth are determined by the feature 

25 extraction calculation described above for the frames of the 



video acquired during the audio response, and these parameters 
are compared with mouth shape parameters obtained from a table 
that records the relationship between mouth shape parameters 
and characters from the character data of the personal 
5 information of the relevant student recorded in the personal 
information database 53, and an overall degree of correlation 
is determined. Because the object of this mouth movement 
determination, as with the time confirmation method used above, 
is to detect whether the video of the student subjected to 

10 facial image matching is prerecorded video data, or whether 

the audio response giving the time is prerecorded video data, 
it is not necessary to determine exact correspondence, and for 
example, a simple method of detecting whether changes occur in 
the shape of the mouth of the student in the period when the 

15 audio response is detected may be used. Furthermore, it is 

also possible to record video of the student at the same time 
as he or she registers the audio of the personal information, 
and then make the mouth movement determination by directly 
comparing the video acquired during the audio response with 

20 the corresponding pre^registered video of the student. 

Furthermore, it is also possible to determine and record the 
mouth shape parameters as time series data from the video of 
the student acquired when he or she registers the audio of the 
personal information, and then make the mouth movement 

25 determination by comparing the mouth shape parameters 
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determined from the video of the audio response with the pre^^ 
registered mouth shape parameters. 

Here, if a determination is made that the movement of the 
mouth of the student corresponds to the audio response, the 
5 flow proceeds to the next step (3320) . If a determination is 

made, at step S316, that the personal information is incorrect, 
or at step S320, that the video does not correspond to the 
audio response, the flow proceeds to S308, and a confirmation 
message is sent to the relevant student terminal. If there is 

10 a response to the message, the flow is repeated from S300. 

^ After the attendance confirmation processing is completed 

for each student, a notification of attendance confirmation 
completion is sent to the relevant student terminal (S400) , 
and a notification is sent to the bidirectional communication 

15 server 20, which terminates video and audio acquisition for 
that student (S402) . 

After attendance confirmation is completed for all 
students (S404) , the flow then repeats from the start (SI 00) 
until the lecture ends. 

20 In the description of the above preferred embodiment, 

facial image matching, facQ facial movement confirmation, time 
related audio response confirmation, and personal information 
related audio response confirmation were all performed at each 
attendance confirmation time, but the present invention is not 

25 limited to this configuration, and any one of a combination of 
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facial image matching and facial movement confirmation, a 
combination of facial image matching and time related audio 
response confirmation, and a combination of facial image 
matching and personal information related audio response 
5 confirmation may be implemented, or any of these may be chosen 
and implemented at random at the attendance confirmation time. 

Furthermore, in the description of the above preferred 
embodiment, the actions requested of the student involved 
moving his or her face or providing an audio response, but the 

10 present invention is not limited to these actions, and any 

action that can be recognized in terms of changes in the video, 
such as rotating the head, closing the eyes, raising a hand, 
showing an object with a specific shape to the camera, or 
showing a piece of paper with a specific shape drawn thereon— 

15 can be used to achieve the effect of the present invention. 

In the description of the above preferred embodiment, 
after facial image matching is performed, the actions for 
f Qcc f acial movement confirmation, time related audio response 
confirmation, and personal information related audio response 

20 confirmation are requested, and a determination as to whether 
the legitimate student is in attendance is made by determining 
whether the image changes at that time correspond to the 
request , but . However, it is also possible to provide a 
function for detecting the position of the face of the student 

25 continuously and confirm that the position of the face does- - 



not change in a discontinuous manner. As a result, if a 
substitute attendee, upon receiving an action request, 
switches to a pre-recorded video corresponding to the action 
request, then the position of the face of the student will 
5 move in a discontinuous manner, making it is possible to 

detect the third person attendance, and the attendance of the 
legitimate student can therefore be confirmed with even 
greater reliability. 

In the description of the above preferred embodiment, 

10 every student attends the lecture from a student terminal, and 
no students attend the lecture of the lecturer in person, but 
it is also possible to set up a lecture which mixes both on- 
site students and remote students, where lectures are 
conducted in a lecture site equipped with a large screen 

15 display device for displaying video sent from the 

bidirectional communication server 20, and loudspeakers for 
outputting the audio sent from the bidirectional communication 
server 20. 

Furthermore, it is possible to also provide satellite 
20 lecture sites in various locations, with large screen display 
devices and loudspeakers provided therein, and treat the 
students gathered in the satellite lecture sites as a single 
gathering of students. 

As a result, students who are capable of traveling to the 
25 lecture site or the satellite lecture site can attend in 



person, and students who cannot travel can attend remotely, 
and as such, it is possible for a greater number of students 
to simultaneously attend a lecture by a single lecturer. In 
this case, it goes without saying that terminals which have an 

■5 input function for answering questions, and video input and 
audio input functions for attendance confirmation may also be 
provided for the students at the lecture site and the students 
at the satellite lecture sites. 

In the description of the above preferred embodiment, the 

10 remote education system compriDco includes a content 

' distribution server, and the educational materials used in the 
lecture are distributed to each student terminal by the 
content distribution server-? — However, the present 
invention is not limited to this configuration, and it is 

15 possible to simply run educational software on the screen or 
the like of the lecturer terminal, and then_^ capture this 
software using the camera which acquires video of the lecturer. 
Furthermore, it is also possible to provide external video and 
audio input terminals on the lecture terminal, and connect a 

20 VTR or DVD or the like to the terminals, and then either 

combine the video and audio of the educational software played 
by the lecturer with video and audio of the lecturer, or 
switch between the two, and send the rooult results to the 
bidirectional communication server, which then distributes the 

25' "data" to each student terminal. . .. _ 



In the description of the above preferred embodiment, the 
lecturer terminal is connected to an internal network of the 
remote education system-; — However, the present invention 
is not limited to this configuration, and the lecturer 
5 terminal may be connected to the remote education system via 
the Internet- As a result, the lecturer can conduct a lecture 
from anywhere as long as he or she has a lecturer terminal 
capable of connecting to the Internet, which enables people 
living overseas to be invited ee — to be a lecturer_^ and also 

10 simplifies scheduling and contributes to lov/cr reduces 
personnel expenses . 

In the description of the above preferred embodiment, the 
remote education system compri boo includes a Web server, and 
the student terminals perform tasks relating to attendance by 

1 5 accessing the Web server using a computer with a Web browser 
function-? — hu ^. However, the present invention is not limited 
to this configuration, and by using a normal videophone as the 
student terminal, transmission of messages from the operations 
management system to the student terminals can be 

20 roaligcd achieved by a function for incorporating message 

images in the bidirectional communication server and sending 
the result to the student terminal, and the input of commands 
and messages to the operations management system from the 
student terminal can be rcaligcd achieved by providing a 

25 function for using the dialing buttons of the user terminal, 
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and having the bidirectional communication server detect the 
PB tone signals. As a result, a telephone-type videophone 
terminal that connects to a public telephone line, such as an 
ISDN videophone in accordance with the ITU-T recommended H.320 
5 protocol or an IP (Internet Protocol) type videophone terminal 
that connects to the Internet, such as an IP videophone 
terminal in accordance with the ITU-T recommended H.323 
protocol, can be used. In this case, video codecs and audio 
codecs corresponding to the protocol of the student terminals 
10 must be provided on the bidirectional communication server, 

but by providing a gateway which performs protocol conversion, 
it is possible to have a mixture of student terminals based on 
various protocols. 

1 5 INDUSTRIAL APrLICABILITY 

As described above, the present invention hao the cffcctG 
of prcvonting p revents a substitute attendance by a third 
person using prerecorded video and the like, and enabling 
enables confirmation that the legitimate students- is in 

20 attendance . 

While preferred embodiments of the invention have been 

described above, it is to be understood that variations and 
modifications will be apparent to those skilled in the art 
without departing the scope and spirit of the invention. The 
25 scope of the invention, therefore, is to be determined solely 
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by the following claims. 
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